Attorney Docket No. 43869.015300 

REMARKS 

The disclosure is objected to because the Examiner states that the mathematical 
formulas are difficult to decipher. The presently attached substitute specification is 
believed to be clear. 

The Specification which was submitted in the March 24, 2004 amendment and 
the presently submitted substitute specification include no new matter. The only 
changes to the specification were the proper titles and paragraph numbering. 

Claims 1-3, 5-1 1 and 15-23 remain under 35 U.S.C. 103(a) as being 
unpatentable over Agrawal et al. 

Applicant respectfully urges reconsideration. 

Claim 13 has been combined with claim 1. Since claim 13 was indicated to be 
allowable, applicants believe all the claims are now in condition for allowance. 

Reconsideration of this rejection is requested. 

It is believed that all of the present claims are in condition for allowance. Early 
and favorable action by the Examiner is earnestly solicited. 
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Attorney Docket No. 43869.015300 



AUTHORIZATION 



If the Examiner believes that issues may be resolved by telephone 



interview, the Examiner is respectfully urged to telephone the undersigned at (212) 801 - 
2146. The undersigned may also be contacted by e-mail at ecr@gtlaw.com. 



hereby authorized to charge any additional fees which may be required for this 
amendment, or credit any overpayment to Deposit Account No. 50-1561 . 



required in addition to that requested in a petition for an extension of time, the 
Commissioner is requested to grant a petition for that extension of time which is 
required to make this response timely and is hereby authorized to charge any fee for 
such an extension of time or credit any overpayment for an extension of time to Deposit 
Account No. 50-1561. 



No additional fee is believed to be necessary. The Commissioner is 



In the event that an extension of time is required, or which may be 



Respectfully submitted, 



Dated: August^, 2004 




Eugene Cl Rzucidlo 
Registration No. 31,900 
Customer Number: 32361 
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PROCESS FOR STORING TEXT AND 
PROCEDURE FOR SEARCHING STORED TEXTS 
FOR THOSE PERTINENT TO A QUESTION 



5 F* A f?TC(TR OT TND OF TFTF T NVFNTTON 

[0001] With the modern word processing methods, of rare permanence, the world of 
documentation has recently experienced substantial expansion. As the requirements or desire 
for knowledge on the part of individuals increase, the information itself is also increasing, 
perhaps even more so. The number of papers, reviews, journals and other publications of all 
10 kinds, even on a particular subject, is also continuing to expand. The storage or filing of data 
has become a difficult task. Conversely, the retrieval of data from a stored batch is no easier 
today. 

[0002] The key-word solution to this twofold problem is well known. Given the size 
of data banks, this is a solution that is often no longer appropriate, since querying a key word 
15 produces both too many and not enough documents as a result of the failure to take into 
account both homonymy (non-pertinent documents) and synonymy. 

[0003] Analysis and search, now microscopic, need to become macroscopic and that 
is what the applicant is seeking to offer here. Documentalists and archivists have to move 
from words to concepts, ideas, in other words, to the plurality, the combination and the 
20 association of words. 

STTMMAKV OF TFTF TNVFNTTON 

[0004] The invention covers the process for the analysis and storage-filing of texts as 
well as the search and retrieval of stored texts. In short, the invention seeks to offer tools for 
improving and organizing knowledge. 
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[0005] The invention covers first of all a process for storing a text according to 
which: 

a word dictionary is created in a multidimensional conceptual reference point, 
each conceptual word from at least a portion of the text to be stored is 
5 compared to the dictionary words to determine the position of this word in said 

reference point, and 

the resultant of the positions of all the conceptual words of the portion of text 
to be stored is determined in order to identify the position of a global 
conceptualization of the portion of text in said reference point and to store this 
10 position. 

[0006] The term "word" must naturally include the linguistic unit, that is to say the 
word in the proper sense of the term, but also the group of words that form a unitary semantic 
expression, such as, for example, "heart attack". 

[0007] The axes of the reference point according to the invention, equal in number to 
1 5 the dimensions, correspond to the different concepts expressed in the dictionary. 

[0008] A word, in the process according to the invention, is defined by a point or by a 
vector that extends from the origin of the reference mark to this point, whose coordinates, on 
the axes of the reference point, correspond respectively to the relative weight of the different 
concepts attached to this word. 
20 [0009] Finally, the storage procedure according to the invention consists in 

vectorizing the words of a text and calculating their conceptual resultant which is 
representative of the entire text in a reference of a plurality of concepts. 

[0010] Advantageously, to determine the resultant of the positions in the reference of 
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all the conceptual words of the portion of text to be stored, each word position in the 
reference is first matched to its position in the text and its syntactic role. 

[0011] Also advantageously, in order to determine the resultant of the positions of the 
conceptual words of the portion of text to be stored, these positions are multiplexed by a 
composition algorithm. 

[0012] The invention also covers a process for searching among a plurality of texts 
stored according to the above-cited procedure for those that deal with a particular question, in 
which: 

as for text storage, the position in the multidimensional conceptual reference 
of a global conceptualization of the question by determining the resultant of 
the positions of all the conceptual words of the question and 
the position of the overall conceptualization of the question is compared to the 
homologous positions of the stored texts in order to select at least one of them, 
corresponding to a searched text. 
[0013] Advantageously, the positions of the global conceptualizations of the question 

are compared to the stored texts, determining, for each text, a distance between the two 

respective positions of the question and of the text. 

[0014] Preferably, the distance determined between two positions is non-Euclidean. 

DFTATT.FD PFSCRIPTTON OF THF TTnJVFNTTOTsI 

[0015] The invention will be more fully understood from the following description of 
different forms of embodiment of the process for the storing of texts and the procedure for 
searching among stored texts for those that deal with a given question, with reference to the 
single annexed figure which represents a multidimensional conceptual reference point. 



[0016] For the sake of clarity, and in order to create a better understanding of the 
invention, the example that will now be described is an instructional example, an extremely 
simplified textbook case. 

[0017] The text storage procedure will first be set forth in detail. 

5 Lz Text storage procedure 

1 1 - Creation of a word dictionary 
[0018] First of all, it is recalled that the term "word 55 is intended to designate a 
linguistic unit, that is, both a word in the proper sense of the term and a group of words 
forming a unitary semantic expression such as, for example, "heart attack 5 ', "identity card' 5 , 
10 "secondary sector 55 , etc. 

[0019] Let us posit a vectorial space of n dimension, n being a natural whole number 
greater than one, to which is attached a conceptual reference point 9?, a scalar product and an 
associated norm. The reference point 9? is made orthonormal. The term orthonormal 
reference is intended to designate a base of n orthogonal vectors (for the defined scalar 
15 product) and a norm equal to one (for the defined norm). By definition, the vectors of the 
base are vectors by linear combination, all of whose vectorial space vectors can be defined. 

[0020] In the instructional example of the description, the vectorial space is three- 
dimensional and provided with a Euclidean scalar product and the associated Euclidean 
norm, as well as a conceptual reference point 9?, represented on the figure, including three 
20 main lines Ai, A2, A3 carrying base vectors u, ,u 2 , u 3 respectively, whose respective 

coordinates in the reference point 9t are (1, 0, 0), (0, 1, 0) and (0, 0, 1). 

[0021] First of all, it will be noted that a position in the reference point 91 is defined 
by a triplet of coordinates respectively following axes Ai, A2, A3, and that for each position 



in the reference there is a corresponding vector with the same coordinates, extending from 
an origin O of the reference point 9*. Subsequently, the terms "position" and "vector" will 
therefore be merged. 

[0022] By definition, the Euclidean scalar product of two vectors X and Y is equal 
to the sum of the products of the homologous coordinates of vectors X and Y . The 
mathematical formula for calculating the Euclidean scalar product is therefore as follows: 



in which 

. <X 9 Y > represents the scalar product of X and Y and 

- xi and yi represent the respective coordinates of vector X and of vector Y along 
axis Ai, 

with n representing the dimension of the vectorial space, equal to three in the example 
of the description. 

[0023] The Euclidean norm X of vector X is defined by the following formula: 



x 



[0024] The unit of each axis corresponds to a concept, an idea expressed in the 
dictionary. In the case in point: 

- the unit of axis Ai corresponding to the concept of physics, 

- the unit of axis A2 corresponding to the concept of the liquid state, and 

- the unit of axis A3 corresponds to the concept of printing. 

[0025] Physics, the liquid state and printing are therefore the three concepts of the 



conceptual reference 9i corresponding to the three dimensions of the reference point 9?. 

[0026] In order to create the word dictionary, the conceptual words are taken from 
among the words in the language, and the position of each of these words in the conceptual 
reference 9? is determined. 
5 [0027] The terms "conceptual word" mean an important word in the text, loaded with 

meaning, expressing one or more ideas, and contributing therefore in a major way to giving 
the text its overall meaning. In short, a conceptual word is a word that can make reference to 
at least one concept of the conceptual reference. 

[0028] For the sake of clarity, a dictionary is created here containing only the words 
1 0 necessary to an understanding of the particular example of the description, to wit the 

following words: body, plunge, liquid, undergo, thrust, vertical, police, think, drowning, 
style, fluid, idea, miss, mechanics. 

[0029] It is clear that a word can have a number of meanings and it is generally 
possible to determine the sense in which this word is being employed in a text, in terms of the 
1 5 context of the text. 

[0030] In order to introduce each of these words into the dictionary, all the possible 
meanings of the word are searched, all the concepts relative to the reference point 9? to which 
this word can possibly make reference are deduced, and, in terms of these concepts, a 
position is assigned in the conceptual reference 9*. The coordinates of the position of each 
20 word correspond to the relative weights of the various concepts attached to this word. In the 
dictionary, each of the words is associated with a position represented by a triplet of 
coordinates in the reference 9?. 

[0031] To illustrate this step in the creation of the dictionary, let us specify in greater 



detail the introduction of certain particular words into the dictionary. 

[0032] Let us first of all take the word "body". According to the dictionary "Le Petit 
Robert" (Le Robert dictionaries edition, 1993), the term body can designate "any material 
body characterized by its physical properties", and "the body of a letter" refers to the 
5 "dimension of a print character". From this, one can deduce that the word "body" can, 
depending on its utilization, refer either to the concept of physics or to the concept of 
printing. On the other hand, in neither of its meanings does "body" refer to the concept of the 
liquid state. The word body is therefore likely to make reference to the concept of physics 
(axis Ai) as well as to the concept of printing (axis A3). Consequently, it is assigned a 
10 position in the conceptual reference 9? a position whose coordinates are (1, 0, 1). 

[0033] Now let us take the word "plunge", which can mean, specifically, "to cause to 
enter into a liquid", according to the dictionary Le Petit Robert. This word is therefore 
capable of making reference to the concept of liquid state (axis A2) but in neither of these 
senses does it refer to the concept of physics (axis Ai) or to the concept of printing (axis A3). 
1 5 Consequently, the word "plunge" is assigned a position in the conceptual reference 9t a 
position whose coordinates are (0, 1, 0). 

[0034] Table 1 contains the coordinates of the positions of all the words in the 
dictionary, determined according to the steps that have just been detailed for two individual 
examples. 

20 
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Table 1 





Coordinates 


Words 


Ax 


A 2 


A3 


body 


i 

J. 


o 

V 


1 

J. 


plunge 


u 


u 


1 


liquid 


1 


1 


0 


undergo 


0 


0 


0 


thrust 


1 


0 


0 


vertical 


0 


0 


0 


police 


0 


0 


1 


think 


0 


0 


0 


drowning 


0 


1 


0 


style 


0 


0 


1 


fluid 


1 


1 


0 


idea 


0 


0 


0 


miss 


0 


0 


0 


mechanics 


1 


0 


0 



[0035] 1 .2 - Global conceptualization of the texts to be stored 
5 In the instructional example of the description, there are three texts to be stored, as follows: 
Text 1 : "Any body plunged into a liquid undergoes a vertical thrust." 
Text 7: "The police think this was a drowning". 
Text 1: "The style is fluid but ideas are missing." 
[0036] In a preliminary step, a syntactic analysis is made of each text to be stored in 
10 order to extract the conceptual words. 

[0037] Thanks to the extraction of the conceptual words, words that make only a 
minor contribution to the global sense of the text, such as pronouns, articles, preposition, etc., 
are eliminated from the next stage of text "vectorizing". 

[0038] To illustrate this extraction step, let us apply it to text 1 . After the analysis of 
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this text and the extraction of conceptual words, the following conceptual words are obtained: 
body, plunged, liquid, undergoes, thrust and vertical. 

[0039] The inflected conceptual words (in other words, the conjugated verbs, 
adjectives in agreement, plural nouns, etc.) are then transformed into their non-inflected 
5 form. 

[0040] The conceptual words extracted from texts 1, 2 and 3, and transformed, if 
necessary, into their non-inflected form, are detailed in table 2. 
Tahlf 2 



Texts 


Words extracted 


1 


bodv. plunge, liquid, undergo, thrust, vertical 


2 


police, think, drowning 


3 


style, fluid, idea, miss, mechanics 



10 [0041] For each text to be stored, the position of each of the conceptual words of this 

text is determined by comparing each of these conceptual words to those of the dictionary in 
which the words are each associated with a position in the reference 9*. 

[0042] In case a conceptual word in the text and a dictionary word are identical, the 
position in the reference 9? associated with this word is read in the dictionary, and this 
1 5 position is assigned to the conceptual word in the text. The positions thus determined for the 
conceptual words extracted from texts 1 to 3 are as indicated in table 1 . 

[0043] Then, for each text to be stored, the resultant of the positions in the reference 
91 of all the conceptual words of the text is determined by multiplexing these positions by 
means of a composition algorithm. This algorithm consists here in finding the vectorial sum 
20 of the positions of all the conceptual words of the text to be stored, that is, adding up the 
homologous coordinates of the positions of the conceptual words of the text. 

[0044] Then, the resultant of the positions of all the conceptual words of the text to be 



stored is normalized and the position of a global conceptualization of this text in the 
reference 9t is obtained. 

[0045] By definition, a vector is normalized when its norm is equal to one. The step 
seeking to "normalize" a vector therefore consists in dividing this vector by its own norm. 

[0046] The mathematical formula for determining the global conceptualization 
position of the index j text is therefore: 




i = \ 



- ni^ represents the vector of the index I conceptual word of the index j text, 

- Tj represents the resultant of the positions of all the conceptual words of the index j 
text, and 

- t j represents the global conceptualization vector of the index j text, with natural 

integer i varying between 1 at Nj (Nj representing the total number of conceptual 
words of the index j) text, and natural integer j varying between 1 and 3. 
[0047] The global conceptualization vector t j of index j text constitutes a vectorial 

representation, in the conceptual reference 9?, of the overall meaning of index j text. 

[0048] The coordinates of global conceptualization vectors t l9 t 2 , t 3 of texts 1, 2 
and 3, respectively, are listed in table 3. 
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Tahle 3 



Text j 


Resultant T, 


Global conceptualization vector t 


Textl 


(3,2, 1) 


(0.802, 0.535, 0.267) 


Text 2 


(0, 1, 1) 


(0, 0.707, 0.707) 


Text 3 


(2, 1, 1) 


(0.816, 0.408, 0.408) 



[0049] Finally, the global conceptualization positions of texts 1, 2 and 3 are stored. 
2 - Searching am ong the many stored tevts for those that Heal with a particular subject 
5 [0050] At this point, the goal is to search among the stored texts (texts 1 , 2 and 3), 

those that deal with a specific subject which, here, is "fluid mechanics". 

[0051] As for the storage of texts, a syntactic analysis is made of the words of the 
question in order to extract the conceptual words which, in this case, are "mechanics" and 
"fluid". 

1 0 [0052] In the event the question contains inflected conceptual words, these words can 

be transformed into their non-inflected form, 

[0053] Each of the conceptual words of the question is compared to those of the 
dictionary in order to determine their position in the conceptual reference 9i. The respective 
positions of the word "mechanics" and of the word "fluid" are indicated in table 1 . 

1 5 [0054] Then, the resultant Q of the positions of all the conceptual words of the 

question is determined by multiplexing the positions of the conceptual words of the question 
using the composition algorithm utilized for storing texts. Finally, the resultant Q is 
normalized in order to obtain the global conceptualization vector q of the question. 
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[0055] The coordinates of vectors Q and q are, respectively, (2, 1, 0) and (0.894, 
0.447, 0). 

[0056] Then, the global conceptualization position of the question is compared to the 
homologous global conceptualization positions of the stored texts in order to retain at least 
5 one of them, corresponding to a text looked for. This comparison consists in calculating, for 
each index j text stored (with natural integer j equal to 1, 2 or 3), the distance Dj between the 
two respective positions of the question and of the text. 

[0057] The distance Dj between the global conceptualization vector q of the question 

and the global conceptualization vector tj of the index j text stored is calculated here using 
10 the following formula: 

[0058] It should be noted that calculation of the distance Dj uses the scalar product of 
vector t j of the index j text and vector q of the question (< t j , q >). 

[0059] Calculation of the distance Dj between the respective positions of the question 
15 and of each of the index j texts stored (with j equal to 1 , 2 or 3) makes it possible to evaluate 
the similarity between the question and each of the stored texts. 

[0060] The results of these distance calculations are indicated in table 4. 
Table 4 





Distance Dj 


text 1 / question 


0.044 


text 2 / question 


0.688 


text 3 / question 


0.088 
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[0061] Based on these results, the most pertinent text, which is one for which the 
distance Dj is the shortest, is text 1, which indeed corresponds to the actual situation. 

[0062] It should be stressed that text 1 is determined to be more pertinent than text 3, 
despite the presence in the latter of the term "fluid". 

[0063] In the preceding description, the global conceptualization vector of a text or of 
the question, is the normalized resultant of the positions of all the conceptual words of this 
text or of the question. It would also be possible to envisage defining the global 
conceptualization vector of a text or of a question as the non-normalized resultant of the 
positions of all the conceptual words of this text or of this question. 

[0064] The formula for calculating the distance Dj between the respective positions of 
the question and an index j stored text would therefore be as follows: 



Q-f, 









Q 







- Q represents the global conceptualization vector of the question and 

- T, represents the global conceptualization vector of the index j text. 

[0065] Indeed, in this case, the resultant of the positions of the conceptual words is 
normalized by calculating the distance between the respective global conceptualization 
positions of the text and of the question. 

[0066] In a variant which differs from the detailed description above only in terms of 
what will now be described, the multidimensional vectorial space is given a non-Euclidean 
scalar product and an associated non-Euclidean norm. 

[0067] The non-Euclidean scalar product of two vectors X and Y is defined by the 
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following formula: 

i=\ 

[0068] The norm associated with vector X is defined by the following formula: 

5 - Xi and yj represent the respective coordinates of vector X and of vector Y along 

axis Ai of the conceptual reference and 
- kj represents a weighting coefficient relative to axis Ai, 

with natural integer i varying between 1 and n, n representing the dimension of the 
vectorial space. 

1 0 [0069] The coefficient ki is fixed in relation to the index i axis in terms of the 

importance of the concept expressed by this axis in the conceptual reference. 

[0070] In this variant, in order to search among a number of stored texts those that are 
pertinent with respect to a question, the global conceptualization positions of the question 
and of the stored texts are compared, and for each text, the distance between the two 

1 5 respective positions of the question and of the text is determined using the distance 
calculation formula specified in the first form of embodiment of the search procedure 
described, and used the non-Euclidean scalar product as defined above. 

[0071] In a second form of embodiment of the text storage procedure, which differs 
from the first form of embodiment described only in terms of what will now be described, for 

20 each text to be stored, one first associates to the position P<r in reference 9? of each 

conceptual word of this text its position in the text Pt as well as its syntactic role Rsym in the 
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text, in order to form, for each conceptual word extracted from the text, a triplet (P^, Pt, 
Rsynt) containing the position P* in reference 9i of the word, its position Pt in the text and its 
syntactic role Rsynt. 

[0072] For each text to be stored, the resultant of the positions of the conceptual 
5 words of the text is determined by multiplexing the triplets of all the conceptual words of the 
text by a composition algorithm, in order to determine the position of the global 
conceptualization of this text. 

[0073] In order to search among the texts stored according to this storage procedure, 
for those that deal with a question, the position of the global conceptualization of the 
10 question is determined. To do this, as for the storage of texts, the resultant of the positions of 
conceptual words of the question is determined by associating each conceptual word of the 
question with a triplet containing the position of this word in the reference $R, its position in 
the question and its syntactic role in the question and by multiplexing these triplets by means 
of the composition algorithm used for the storage. 
1 5 [0074] The position of the global conceptualization of the question is then compared 

to the homologous positions of the stored texts, by calculating the distance between these 
positions. From this is deduced the similarity between the question and the stored texts and, 
therefore, the most pertinent texts that deal with the question. 

[0075] In a third form of embodiment of the text storage procedure, which differs 
20 from the first form of embodiment described only in terms of what will now be described, the 
text is broken up into a number of segments. Each segment initially contains a predefined 
number of conceptual words, five in this case, that are close to one another in the text. 

[0076] Two segments are referred to as "close" or "neighboring" when they are side 
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by side in the text or separated from one another only by non-conceptual words. 

[0077] The positions in the conceptual reference of all the conceptual words of the 
text are determined. For each text segment, the resultant of the positions of all the conceptual 
words of this segment is determined by multiplexing these positions by means of the 
composition algorithm utilized in the first form of embodiment of the storage procedure 
described. This resultant is then normalized in order to obtain the global conceptualization 
position of the segment in the conceptual reference. 

[0078] The global conceptualization positions of the neighboring segments in the text 
are then compared two by two by calculating, for each pair of neighboring segments, the 
distance between the two respective conceptualization positions of the two segments, using 
the calculation formula of the distance specified in the first form of embodiment of the search 
procedure. 

[0079] If the distance between the respective global conceptualization positions of 
two neighboring segments is under a predefined threshold, in other words, if these two 
segments have close meanings, these two segments are combined to form a new segment 
whose global conceptualization position is then determined. 

[0080] On the other hand, if the distance between the global conceptualization 
positions of two neighboring segments is above the predefined threshold, in other words, if 
these two segments have unrelated meanings, the two segments are not combined. 

[0081] The step that consists in combining the neighboring segments is repeated until 
they can no longer be combined. The iterative regrouping of segments delimits a number of 
text portions that are such that the distance between the respective global conceptualization 
positions of two neighboring text portions is over the predefined threshold. In other words, 
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the global meaning of each part of the text is quite removed from the global meaning of a 
neighboring part. 

[0082] To compare a question and a stored text containing a number of portions each 
represented by its global conceptualization position in the conceptual reference, the position 
5 of each of the text portions is compared to the position of the question, by calculating the 
distance between these positions. A text is considered to be pertinent if the distance between 
the position of one of its portions and the position of the question is short. 

[0083] Of course, the question could be broken down into a number of portions each 
represented by its global conceptualization position. 
1 0 [0084] In this case, the vectors of the portions or a stored text and those of the 

portions of the question would be compared two by two. The text is considered to be 
pertinent if the distance between the position of one of its portions and the position of one of 
the portions of the question is short. 

[0085] It should be noted that in the third form of embodiment of the storage 
15 procedure, each of the portions of a text is stored in the same way that a text (consisting of 
only one portion) is stored in the first mode of the storage procedure. Finally, a "text" and a 
"text portion" are two equivalent word sets. 

[0086] Concerning the composition algorithm for determining the resultant of 
conceptual word positions of a text, a text segment or a question, it is also possible, instead of 
20 only finding the vectorial sum of the positions of the conceptual positions of the text, text 

segment or question, to amplify the values of the strongest coordinates of the vector resulting 
from the vectorial sum of the positions of the conceptual words, for example by multiplying 
them by a predefined coefficient. In this way, the importance of the most important concepts 
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is farther amplified to the detriment of the less important concepts, in order to prevent any 
possible ambiguity when comparing the global conceptualization vectors of a text and of a 
question. Indeed, the interference due to the fact that the coordinates have weak 
conceptualization vector values is therefore reduced. 

[0087] To illustrate this variant, let us apply it to text 1 . By finding the vectorial sum 
of the positions of all the conceptual words of this text, the vector (3, 2, 1) is obtained. In 
order to obtain the resultant of the positions of all the conceptual words of text 1, the 
strongest coordinates, which are those along axes Ai and A2, are multiplied by a coefficient 
which here is equal to 2. The resultant of text 1 is therefore the vector (6, 4, 1). 

[0088] In the instructional example described above, the question "fluid mechanics" 
contained few words. Obviously, one could take a question containing many more words and 
even comprise a text. 

[0089] In practice, the conceptual reference 91 includes several hundred dimensions, 
and the dictionary contains several thousand words. 



-18- 



WHAT IS CLAIMED IS: 

1 . Text storage procedure (1) according to which: 

- a dictionary of words is created in a multidimensional conceptual reference, 

- each conceptual word is compared to at least one portion of the text to be stored (1) 
to those of the dictionary in order to determine the position of this word in said 
reference and 

- the resultant (T, ) of the positions of all the conceptual words of the text portion to 
be stored (1) is determined in order to determine the position of a global 
conceptualization of the text portion (1) in said reference and to store that position. 

2. Procedure according to claim 1, in which, to determine the resultant of the 
positions in the reference of all the conceptual words of the text portion to be stored, each 
word position in the reference is first associated with its position in the text and its syntactic 
role. 

3. Procedure according to claim 1, in which, to determine the resultant (T, ) of the 
positions of the conceptual words of the text portion to be stored (1), these positions are 
multiplexed using a composition algorithm. 

4. Procedure according to claim 3, in which the composition algorithm consists in 
finding the vectorial sum of the positions of all the conceptual words of the text portion to be 
stored (1). 

5. Procedure according to claim 4, in which the composition algorithm also consists 
in amplifying the importance of the most important concepts. 

6. Procedure according to claim 1, in which the resultant (Tj ) of the positions of all 

the conceptual words of the text portion to be stored (1) is normalized. 
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7. Procedure according to claim 1, in which the multidimensional conceptual 
reference is made orthonormal. 

8. Procedure according to claim 1, in which, for each word to be included in the 
dictionary, all the concepts related to the conceptual reference to which this word is likely to 

5 make reference are searched and, in terms of these concepts, the word is assigned a position 
in the conceptual reference. 

9. Procedure according to claim 1, in which an syntactic analysis of all the words of 
the text portion (1) is made in order to extract the conceptual words. 

10. Procedure according to claim 1, in which the inflected words of the text portion 
10 to be stored (1) are transformed into the non-inflected form. 

1 1 . Procedure for storage of a text containing a number of text portions in which each 
text portion is stored according to the procedure per claim 1 . 

12. Procedure according to claim 1 1, in which the text is broken up into a number of 
segments whose respective global conceptualization positions in the conceptual reference are 

15 determined, and the respective global conceptualization portions of the neighboring segments 
in the text are compared in order to delimit the text portions. 

13. Procedure according to claim 1 1, in which, in order to compare the respective 
global conceptualization positions of two neighboring segments of the text, the distance 
between these positions is determined and, in the event this distance is under a predefined 

20 threshold, the two segments are combined to form a new segment. 

14. Procedure according to claim 13, in which the text portions are formed by 
iterative groupings of segments. 
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15. Procedure for searching among a number of stored texts according to the storage 
procedure of claim 1 for those that deal with a particular question, in which: 

- as for any text storage, the position in the multidimensional conceptual reference of 
a global conceptualization of the question is determined by determining the resultant 

( Q ) of the positions of all the conceptual words of the question and 

- the position of the global conceptualization of the question is compared to the 
homologous positions of the stored texts in order to select at least one of them 
corresponding to a searched text. 

16. Procedure according to claim 15, in which the positions of the global 
conceptualizations of the question and of the stored texts are compared by determining, for 
each text, the distance between the two respective positions of the question and of the text. 

17. Procedure according to claim 15, in which calculation of the distance between 
two positions in the conceptual reference utilizes the scalar product of these positions. 

18. Procedure according to claim 17, in which the distance between two positions 
in the conceptual reference is calculated using the following formula: 



D =1- 




X\Y\\Y\ 



in which 



- X and Y represent the two positions, 



- D represents the distance between the two positions X and Y 9 



- <X 9 Y > represents the scalar product of X and of Y 9 and 



- X • and Y represent the respective norms of X and of Y . 
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19. Procedure according to claim 15, in which the distance determined between two 
positions is non-Euclidean. 

20. Procedure according to claim 19, in which the distance determined between 
two positions uses the scalar product defined by the following formula: 



in which 

- < X 9 Y > represents the scalar product of two positions X and Y 9 

- n 5 a natural integer, represents the dimension of the conceptual reference containing 
n index i axes with a natural integer i varying between 1 and n, 

- Xj and yi represent the respective coordinates of the positions X and Y along the 
index i axis and 

- kj represents a weighting coefficient relative to the index i axis. 

21. Procedure according to claim 15, in which the resultant (Q) of the positions of all 
the conceptual words of the question is normalized. 

22. Procedure according to claim 15, in which a syntactic analysis is made of all the 
words of the question in order to extract the conceptual words. 

24. [sic] Procedure according to claim 15, in which the inflected words of the 
question are transformed into their non-inflected form. 
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ABSTRACT 

In a multidimensional conceptual reference, a dictionary of words is created, each 
conceptual word of at least one portion of the text to be stored is compared to those of the 
dictionary to determine the position of this word in said reference, and the resultant (Ti) of 
5 the positions of all the conceptual words of the text portion to be stored is determined in 
order to determine the position of a global conceptualization of the text portion in said 
reference and to store this position. The position in a multidimensional conceptual reference 
of a global conceptualization of the question is determined, and the position of the global 
conceptualization of the question is compared to the homologous positions of the stored 
10 texts, in order to select at least one of them, corresponding to a searched text. 



